Information processing apparatus

ABSTRACT

To make it easier to switch to video that is being viewed by another user, a source data acquisition unit acquires source data of content that is distributed by an apparatus. A video clipping unit clips video in a range specified by a participant&#39;s terminal out of 360-degree video indicated by the source data supplied thereto. Upon receiving a range of video specified in association with the participant&#39;s terminal, a content distribution unit clips video in the range out of 360-degree video acquired by the source data acquisition unit and distributes the clipped video to the participant&#39;s terminal. Also, upon a display switching instruction being made in association with a switching source participant&#39;s terminal, the content distribution unit distributes video that is clipped according to a range of video specified in association with a switching destination participant&#39;s terminal, to the switching source participant&#39;s terminal.

TECHNICAL FIELD

The present invention relates to video distribution technology.

BACKGROUND

Japanese Patent Application No. JP2017-504863A discloses technology forintegrating and presenting visual data and non-visual data acquired byan unmanned aerial vehicle, using a virtual reality terminal.

Video distributed using AR (Augmented Reality) technology, VR (VirtualReality) technology, or the like provides an enhanced sense of beingthere to a viewer, compared to normal video. In addition, if a viewercan view video that is the same as video that is viewed by anotherviewer, these viewers can share an enhanced sense of unity, whichfacilitates human communication in the real world as well.

Therefore, an object of the present invention is to make it easier toswitch to video that is viewed by another user.

SUMMARY OF THE INVENTION

To achieve the above-described object, the present invention provides aninformation processing apparatus including: an image acquisition unitthat acquires a captured image; and a distribution unit that, uponreceiving a range of image specified in association with a first displayapparatus, clips a partial image in the range from the acquired imageand distributes the clipped partial image to the first displayapparatus, and upon a display switching instruction being made inassociation with the first display apparatus, distributes anotherpartial image clipped in a range specified in association with a seconddisplay apparatus, to the first display apparatus.

The information processing apparatus may further include an accumulationunit that accumulates the acquired image, and the distribution unit maydistribute the other partial image clipped in the past according to therange specified in association with the second display apparatus, fromthe accumulated image, to the first display apparatus.

The information processing apparatus may further include: a soundacquisition unit that acquires sound that reaches the second displayapparatus; and a first transmission unit that transmits data thatindicates a direction corresponding to the second display apparatus, andthe acquired sound, to the first display apparatus, and the firstdisplay apparatus may include a speaker that outputs sound indicated bythe data transmitted by the first transmission unit, as sound that comesfrom the direction indicated by the data, and upon an instruction beingmade to change an orientation of the first display apparatus to thedirection indicated by the data transmitted to the first displayapparatus, the distribution unit may distribute the other partial imageclipped according to the range specified in association with the seconddisplay apparatus, to the first display apparatus.

The information processing apparatus may further include a secondtransmission unit that transmits a direction instructing image thatindicates a direction corresponding to the second display apparatus, tothe first display apparatus, and when an instruction is made to changean orientation of the first display apparatus to the direction indicatedby the direction instructing image thus transmitted, the distributionunit may distribute the other partial image clipped according to therange specified in association with the second display apparatus, to thefirst display apparatus.

The information processing apparatus may further include a positionacquisition unit that acquires position information indicating at leastone position of one or more display apparatuses, and the distributionunit may determine another display apparatus from among the one or moredisplay apparatuses, from which the position information indicating aposition that is at a distance within a predetermined range to the firstdisplay apparatus has been acquired, as the second display apparatus todistribute the other partial image.

The information processing apparatus may further include a languageacquisition unit that acquires language information indicating at leastone language that is used in one or more display apparatuses, and thedistribution unit may determine another display apparatus from among theone or more display apparatuses, from which the language informationindicating a language that is the same as a language used in the firstdisplay apparatus has been acquired, as the second display apparatus todistribute the other partial image.

The information processing apparatus may further include a time zoneacquisition unit that acquires time zone information indicating at leastone standard time zone used at a position of one or more displayapparatuses, and the distribution unit may determine another displayapparatus from among the one or more display apparatuses, of which astandard time zone indicated by the time zone information acquiredtherefrom has a time difference smaller than a threshold value whencompared to another standard time zone indicated by the time zoneinformation acquired from the first display apparatus, as the seconddisplay apparatus to distribute the other partial image.

The information processing apparatus may further include a plurality ofcapturing apparatuses that capture images, and the distribution unit maydetermine another display apparatus that is displaying an image capturedby a capturing apparatus that is the same as a capturing apparatus thathas captured an image displayed by the first display apparatus, as thesecond display apparatus to distribute the other partial image.

The information processing apparatus may further include: a plurality ofcapturing apparatuses that capture images; and a capture positionacquisition unit that acquires capture position information indicatingpositions of the plurality of capturing apparatuses, and when theplurality of capturing apparatuses include another capturing apparatusfrom which the capture position information indicating a position at adistance within a predetermined range to a capturing apparatus that hascaptured image displayed by the first display apparatus has beenacquired, the distribution unit may determine another display apparatusthat is displaying the image captured by the other capturing apparatusas the second display apparatus to distribute the other partial image.

According to the present invention, it is possible to make it easier toswitch to video that is being viewed by another user.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram illustrating an overall configuration of a virtualtravel system according to an embodiment.

FIG. 2 is a diagram illustrating a hardware configuration of a proxyuser's terminal.

FIG. 3 is a diagram illustrating a hardware configuration of aparticipant's terminal.

FIG. 4 is a diagram illustrating a hardware configuration of adistribution server apparatus.

FIG. 5 is a diagram illustrating a functional configuration realized bya virtual travel system.

FIG. 6 is a diagram illustrating an example of a specified range.

FIG. 7A is a diagram illustrating an example of a switching instruction.

FIG. 7B is a diagram illustrating an example of a switching instruction.

FIG. 7C is a diagram illustrating an example of a switching instruction.

FIG. 8 is a diagram illustrating an example of operation sequencescarried out by apparatuses in a provision process.

FIG. 9 is a diagram illustrating a functional configuration realized bya distribution server apparatus according to a variation.

FIG. 10 is a diagram illustrating an example of video output accordingto a variation.

FIG. 11 is a diagram illustrating a functional configuration realized bya virtual travel system according to a variation.

FIG. 12 is a diagram illustrating an overall configuration of a virtualtravel system according to a variation.

FIG. 13 is a diagram illustrating a functional configuration realized bya proxy user's terminal according to the variation.

FIG. 14 is a diagram illustrating an example of a positionalrelationship between proxies.

FIG. 15 is a diagram illustrating an example of generated video.

FIG. 16 is a diagram illustrating a functional configuration realized bya participant's terminal according to a variation.

DETAILED DESCRIPTION

FIG. 1 illustrates an overall configuration of virtual travel system 1according to an embodiment. Virtual travel system 1 is a system forproviding a virtual travel service through which users can virtuallyenjoy travel without actually going to a travel destination such as atourist spot. Virtual travel system 1 is mainly used by two types ofusers, namely a proxy user who captures video at a travel destination,and a participant who takes part in virtual travel.

Virtual travel system 1 includes network 2, proxy user's terminal 10,participant's terminals 20-1, 20-2, 20-3, and so on (hereinafterreferred to as “participant's terminals 20” when they are notdistinguished from each other), and distribution server apparatus 30.Network 2 is a communication system including a mobile communicationnetwork, the Internet, and the like, and relays the exchange of databetween apparatuses accessing that system. Network 2 is accessed bydistribution server apparatus 30 through wired communication, and byproxy user's terminal 10 and participant's terminals 20 through wirelesscommunication. Network 2 may be accessed through wired communication orwireless communication.

Proxy user's terminal 10 is an image capturing apparatus that is carriedby a proxy user at a travel destination, and captures video at thetravel destination. In the present embodiment, proxy user's terminal 10is attached to the head of a proxy user, and captures so-called360-degree video (also called all-direction image or omnidirectionalimage), which is moving images including images on the front, rear,left, right, top, and bottom side of the proxy user. Proxy user'sterminal 10 also records sound using a microphone thereof, and transmitsit to distribution server apparatus 30 along with captured 360-degreevideo.

Distribution server apparatus 30 is a distribution apparatus thatdistributes, to participant's terminals 20, virtual travel contentincluding video, sound, and so on captured or recorded by proxy user'sterminal 10 at a travel destination. The distribution server apparatus30 clips and distributes a certain range of video specified fromparticipant's terminal 20 out of captured 360-degree video.Participant's terminals 20 are output apparatuses that output contentdistributed by distribution server apparatus 30, and, for example,display video included in the content, and emit sound included in thecontent.

In the present embodiment, participant's terminal 20 is an HMD (HeadMounted Display) attached to the head of a user. Participant's terminal20 measures the direction in which the front face of the terminal isorientated (the azimuth and the angles of elevation and depression), andspecifies a range of video corresponding to the measured direction fordistribution server apparatus 30. As a result, for example, if aparticipant faces east, video captured at the position of proxy user'sterminal 10 in a direction to the east is distributed, and if aparticipant faces upward, video captured by the proxy user's terminal 10in a vertically upward direction is distributed.

FIG. 2 is a diagram illustrating a hardware configuration of proxyuser's terminal 10. Proxy user's terminal 10 is a computer that includesapparatuses such as processor 11, memory 12, storage 13, communicationunit 14, input unit 15, output unit 16, 360-degree camera unit 17,sensor unit 18, and bus 19. The term “apparatus” used here can bereplaced with “circuit”, “device”, “unit”, or the like. One or more ofeach apparatus may be included, and some apparatuses may be omitted.

Processor 11 controls the computer as a whole by running an operatingsystem, for example. Processor 11 may be constituted by a centralprocessing unit (CPU) including an interface with peripheralapparatuses, a control apparatus, a computation apparatus, registers,and the like. Additionally, processor 11 reads out programs (programcode), software modules, data, and the like from storage 13 and/orcommunication unit 14 into memory 12, and then executes various types ofprocesses in accordance therewith.

There may be one, or two or more, processors 11 that execute the varioustypes of processes, and two or more processors 11 may execute varioustypes of processes simultaneously or sequentially. Processor 11 may beprovided as one or more chips. The programs may be transmitted from anetwork over an electrical communication line.

Memory 12 is a computer-readable recording medium, and may beconstituted by at least one of ROM (Read Only Memory), EPROM (ErasableProgrammable ROM), EEPROM (Electrically Erasable Programmable ROM), RAM(Random Access Memory), and so on, for example. Memory 12 may be calleda “register”, “cache”, “main memory” (a main storage apparatus), or thelike. Memory 12 can store the aforementioned programs (program code),software modules, data, and the like.

Storage 13 is a computer-readable recording medium, and may beconstituted by at least one of an optical disk such as a CD-ROM (CompactDisc ROM), a hard disk drive, a flexible disk, a magneto-optical disk(e.g., a compact disk, a digital versatile disk, or a Blu-ray(registered trademark) disk), a smartcard, flash memory (e.g., a card, astick, or a key drive), a Floppy (registered trademark) disk, a magneticstrip, and the like.

Storage 13 may be called an auxiliary storage apparatus. Theaforementioned storage medium may be a database, a server, or anotherappropriate medium including memory 12 and/or storage 13, for example.Communication unit 14 is hardware for communicating between computersover a wired and/or wireless network (a transmission/reception device),and is also called a network device, a network controller, a networkcard, a communication module, and the like, for example.

Input unit 15 is an input device that accepts inputs from the exterior(e.g., a keyboard, a mouse, a microphone, a switch, a button, a sensor,or the like). In addition, input unit 15 includes, as an input device, amicrophone that collects sound (ambient sound) generated in the vicinityof the terminal and reaching the terminal. Output unit 16 is an outputdevice that makes outputs to the exterior (e.g., a display, a speaker,an LED lamp, or the like).

360-degree camera unit 17 is an image capturing device that captures theabove-described 360-degree video. 360-degree camera unit 17 includes twoor more wide angle cameras, for example, and captures 360-degree videoby orientating the wide angle cameras in different directions so thatall directions are within the coverage of image capturing (note that itis not necessary to cover all directions without exception, and theremay be a blind spot at the proxy user's feet). Sensor unit 18 includes asensor that measures the orientation of 360-degree camera unit 17 (amagnetic sensor or the like).

The 360-degree camera unit 17 has a front face determined in advance,and sensor unit 18 measures the azimuth of the direction in which thefront face is orientated (the forward direction of the camera unit) andthe angles of elevation and depression of the front face. If the proxyuser moves or turns, the forward direction of the camera unit changes,and sensor unit 18 outputs measurement values indicating theorientation. These measurement values indicate, for example, which partof the captured 360-degree video is video captured in a direction to theeast from the terminal, which part is video captured upward in avertical direction from the terminal, and so on.

Sensor unit 18 may be built into 360-degree camera unit 17. Themicrophone of input unit 15 may also be built into 360-degree cameraunit 17. The apparatuses such as processor 11 and memory 12 can accesseach other over bus 19, which is used for communicating information. Bus19 may be constituted by a single bus, or may be constituted by busesthat differ among the apparatuses.

Proxy user's terminal 10 and so on may be configured including hardwaresuch as a microprocessor, a DSP (Digital Signal Processor), an ASIC(Application Specific Integrated Circuit), a PLD (Programmable LogicDevice), an FPGA (Field Programmable Gate Arrays, and the like, and someor all of the function blocks may be realized by that hardware. Forexample, processor 11 may be provided as at least one of these types ofhardware.

FIG. 3 is a diagram illustrating a hardware configuration ofparticipant's terminal 20. Participant's terminal 20 is a computer thatincludes apparatuses such as processor 21, memory 22, storage 23,communication unit 24, input unit 25, output unit 26, sensor unit 27,and bus 28. These apparatuses are of the same types as the apparatusesof the same names in FIG. 2 even though capabilities, specifications, orthe like are different. However, they are not completely the same, andthe following describes particularly important points that differ fromthe apparatuses in FIG. 2.

Output unit 26 includes a speaker that allows a user to discern thedirection from which sound comes (so-called sound localization). Aparticipant wearing participant's terminal 20 can discern the directionfrom which sound comes, e.g. the participant can hear anotherparticipant's voice from behind him/her. Sensor unit 27 includes asensor (a magnetic sensor or the like) that measures the orientation ofthe front face of the terminal, and measures the azimuth of thedirection in which the front face is orientated and the angles ofelevation and depression of the front face.

FIG. 4 is a diagram illustrating a hardware configuration ofdistribution server apparatus 30. Distribution server apparatus 30 is acomputer that includes apparatuses such as processor 31, memory 32,storage 33, communication unit 34, and bus 35. These apparatuses are ofthe same types as the apparatuses of the same names in FIG. 2 eventhough capabilities, specifications, or the like are different.

Proxy user's terminal 10, participant's terminals 20, and distributionserver apparatus 30 included in virtual travel system 1 store programsprovided by the system, and implement the following group of functionsby the processors included in the apparatuses executing programs andcontrolling the various units.

FIG. 5 illustrates a functional configuration realized by virtual travelsystem 1. Proxy user's terminal 10 includes video capturing unit 101,ambient sound recording unit 102, camera orientation measurement unit103, and capture data transmission unit 104.

Participant's terminal 20 includes voice recording unit 201, terminalorientation measurement unit 202, specified range determination unit203, and content output unit 204. Although FIG. 5 illustrates only oneparticipant's terminal 20, every one of a plurality of participant'sterminals 20 realizes the functional configuration illustrated in FIG.5. Distribution server apparatus 30 includes source data acquisitionunit 301, video clipping unit 302, specified range acquisition unit 303,content generation unit 304, content data transmission unit 305, andswitching instruction acceptance unit 306.

In virtual travel system 1, distribution server apparatus 30 storesterminal IDs (Identifications) respectively identifying participant'sterminals 20, and pieces of address information (IP (Internet Protocol)addresses or the like) used when video is distributed to thoseparticipant's terminals 20, in association with each other. Also, proxyuser's terminal 10 and participant's terminals 20 store addressinformation regarding distribution server apparatus 30, and can exchangedata with distribution server apparatus 30 when necessary.

Video capturing unit 101 of proxy user's terminal 10 captures video ofthe proxy user's surroundings. In the present embodiment, videocapturing unit 101 captures 360-degree video by controlling 360-degreecamera unit 17 illustrated in FIG. 2. During a virtual travel serviceproviding period (a period from a service start time to a service endtime that have been determined), video capturing unit 101 captures360-degree video while simultaneously supplying the captured 360-degreevideo to capture data transmission unit 104.

Ambient sound recording unit 102 controls the microphone provided ininput unit 15 illustrated in FIG. 2 to record the above-describedambient sound (sound generated in the vicinity of the terminal andreaching the terminal). During the virtual travel service providingperiod, ambient sound recording unit 102 records ambient sound whilesimultaneously supplying the recorded ambient sound to capture datatransmission unit 104.

Camera orientation measurement unit 103 controls sensor unit 18illustrated in FIG. 2 to measure the azimuth of the direction in whichthe predetermined front face of 360-degree camera unit 17 is orientatedand the angles of elevation and depression of the front face. Theazimuth and the angles of elevation and depression are informationindicating the orientation of the front face of 360-degree camera unit17, and are hereinafter referred to as “camera orientation information”.During the virtual travel service providing period, camera orientationmeasurement unit 103 repeatedly measures camera orientation information(the azimuth and the angles of elevation and depression) whilesimultaneously and repeatedly supplying the measured camera orientationinformation to capture data transmission unit 104.

Capture data transmission unit 104 transmits capture data, which is dataregarding video captured by the terminal, to distribution serverapparatus 30. Specifically, capture data transmission unit 104generates, as capture data, data that indicates 360-degree videosupplied from video capturing unit 101, ambient sound supplied fromambient sound recording unit 102, and camera orientation informationsupplied from camera orientation measurement unit 103, and transmits thecapture data to distribution server apparatus 30.

Voice recording unit 201 of participant's terminal 20 records the voiceof the participant wearing the terminal. Voice recording unit 201transmits voice data indicating the recorded voice to distributionserver apparatus 30. Source data acquisition unit 301 of distributionserver apparatus 30 acquires data (source data) from which content(virtual travel content) that is to be distributed by the apparatus isgenerated. Source data is constituted by capture data transmitted fromproxy user's terminal 10 and voice data transmitted from participant'sterminal 20.

Source data acquisition unit 301 acquires capture data transmitted fromproxy user's terminal 10 as the source data of content, therebyacquiring the video indicated by the capture data, i.e. video capturedby proxy user's terminal 10 (360-degree video in the presentembodiment). Source data acquisition unit 301 is an example of “an imageacquisition unit” according to the present invention. Source dataacquisition unit 301 also acquires capture data, thereby acquiring thesound indicated by the capture data, i.e. sound (ambient sound) reachingproxy user's terminal 10.

Each time source data acquisition unit 301 acquires capture data andvoice data as source data of content, source data acquisition unit 301supplies the acquired source data to video clipping unit 302. Videoclipping unit 302 clips video in a range specified by participant'sterminal 20 out of 360-degree video indicated by the supplied sourcedata. A range is specified in the following manner. Terminal orientationmeasurement unit 202 of participant's terminal 20 measures the azimuthof the direction in which the predetermined front face of the terminalis orientated and the angles of elevation and depression of the frontface as information indicating the orientation of the terminal(hereinafter referred to as “terminal orientation information”).

Terminal orientation measurement unit 202 repeatedly measures terminalorientation information at predetermined time intervals, and repeatedlysupplies the measured terminal orientation information to specifiedrange determination unit 203. Specified range determination unit 203determines a specified range that is to be specified as a range of videothat is to be displayed on the terminal, based on the terminalorientation information measured by terminal orientation measurementunit 202. For example, specified range determination unit 203determines, as a specified range, a range of video captured in adirection indicated by the supplied terminal orientation information, bya camera that has a certain angle of view (e.g. a vertical angle of 25degrees and a horizontal angle of 45 degrees).

FIG. 6 illustrates an example of a specified range. FIG. 6 illustrates360-degree video A1 captured by video capturing unit 101. 360-degreevideo A1 is video captured by 360-degree camera unit 17 in a state where360-degree camera unit 17 is orientated in the true north and the angleof tilt is 0 degrees (in a state where the forward direction of thecamera is horizontal). Therefore, in 360-degree video A1, the horizon islocated at the middle of the video (at 0 degrees) in the verticaldirection, and the true north direction is located at the middle of thevideo in the horizontal direction. The upper video represents video ofobjects that are located vertically upward of 360-degree camera unit 17(90 degrees), and the lower video represents video of objects that arelocated vertically downward of 360-degree camera unit 17 (−90 degrees).

For example, if specified range determination unit 203 determines therange of captured video corresponding to the north direction and theelevation angle of 20 degrees as a specified range, it means thatspecified range C1 centered around virtual sight direction B1 (thedirection toward which the participant virtually looks), which indicatesa position corresponding to the north direction and the elevation angleof 20 degrees in 360-degree video A1, is determined. Also, if specifiedrange determination unit 203 determines the range of captured videocorresponding to the southeast direction and the depression angle of 45degrees as a specified range, it means that specified range C2 centeredaround virtual sight direction B2, which indicates a positioncorresponding to the southeast direction and the depression angle of 45degrees in 360-degree video A1, is determined.

Specified range determination unit 203 generates information thatindicates the azimuth and the angles of elevation and depression, whichindicate the virtual sight direction, the angle of view, and theterminal ID of the terminal, as range information indicating thedetermined specified range, and transmits the range information todistribution server apparatus 30. Specified range acquisition unit 303of distribution server apparatus 30 receives the transmitted rangeinformation, thereby acquiring the range of video specified byparticipant's terminal 20 identified by the terminal ID indicated by therange information, i.e. the specified range. Specified range acquisitionunit 303 supplies range information indicating the acquired specifiedrange to video clipping unit 302 and switching instruction acceptanceunit 306.

Upon being supplied with the source data of content from source dataacquisition unit 301, video clipping unit 302 clips video in thespecified range indicated by the range information supplied fromspecified range acquisition unit 303 (in the example in FIG. 6, video inspecified range C1 or video in specified range C2) out of the 360-degreevideo indicated by the supplied source data. Note that, if the forwarddirection of 360-degree camera unit 17 is tilted relative to thehorizontal direction, 360-degree video A1 illustrated in FIG. 6 will bedisplaced in the vertical direction (the horizon will be displaced inthe vertical direction).

Video clipping unit 302 corrects the tilt based on the orientation ofthe front face indicated by camera orientation information included insource data to acquire video like 360-degree video A1, and thereafterclips video in the specified range, for example. Well-known technologyused in VR (Virtual Reality) or AR (Augmented Reality) may be used forthis correction. Note that, instead of correcting the 360-degree video,video clipping unit 302 may calculate positions corresponding to theazimuth and the angles of elevation and depression in the tilted360-degree video to clip video in the specified range.

Video clipping unit 302 supplies video thus clipped to contentgeneration unit 304 together with the range information and the sourcedata. Content generation unit 304 generates content indicating thesupplied video, i.e. the video clipped in the range specified byparticipant's terminal 20, and ambient sound indicated by the suppliedsource data. Content generation unit 304 generates content dataindicating the generated content and the terminal ID indicated by thesupplied range information.

Also, content generation unit 304 adds, to the generated content data,the participant's voice indicated by the supplied source data and thedirection corresponding to participant's terminal 20 used by theparticipant. Specifically, content generation unit 304 adds, to thecontent data, the direction indicated by the range of video specifiedfrom participant's terminal 20 (in the example in FIG. 6, virtual sightdirection B1 (the north direction) or virtual sight direction B2 (thesoutheast direction)) as the direction corresponding to thatparticipant's terminal 20.

Content generation unit 304 supplies content data thus generated tocontent data transmission unit 305. Content data transmission unit 305transmits the supplied content data to participant's terminal 20identified by the terminal ID indicated by the content data. Contentoutput unit 204 of participant's terminal 20 outputs content indicatedby the content data transmitted from distribution server apparatus 30.

Specifically, content output unit 204 outputs video indicated by thecontent data to the display of output unit 26 to display the video, andoutputs ambient sound indicated by the content data to the speaker ofoutput unit 26 to emit the sound. Content thus output is the video andambient sound captured and recorded by the proxy user at the traveldestination, and is content that provides the participant with virtualtravel experience. The participant wearing participant's terminal 20views content thus output.

To view content means to watch displayed video content and listen toemitted sound content. Note that to watch content that includes videobut does not include sound is also referred to as “to view content”. Asdescribed above, in the present embodiment, participant's terminal 20 isa display apparatus that displays 360-degree video (specifically, videoclipped out of 360-degree video) acquired by source data acquisitionunit 301.

Range information is transmitted from that participant's terminal 20,and thus a range of video is specified in association with thatparticipant's terminal 20. Video clipping unit 302, specified rangeacquisition unit 303, content generation unit 304, and content datatransmission unit 305 of distribution server apparatus 30 function ascontent distribution unit 307 that, upon receiving the range of video,clips the video in the range out of the 360-degree video acquired bysource data acquisition unit 301, and distributes it to thatparticipant's terminal 20. Content distribution unit 307 is an exampleof “a distribution unit” according to the present invention.

In the present embodiment, each participant's terminal 20 is providedwith a speaker that outputs sound indicated by content data transmittedby content data transmission unit 305, as sound that comes from thedirection indicated by the content data. Source data acquisition unit301 of distribution server apparatus 30 acquires voice data transmittedfrom participant's terminal 20, thereby acquiring sound reaching thatparticipant's terminal 20 (the participant's voice in the presentembodiment). In this case, source data acquisition unit 301 is anexample of “a sound acquisition unit” according to the presentinvention.

Content data transmission unit 305 of distribution server apparatus 30transmits content data indicating sound acquired from participant'sterminal 20 (the participant's voice) and the direction corresponding tothat participant's terminal 20 (the direction indicated by the range ofvideo specified by that participant's terminal 20) to anotherparticipant's terminal 20. In this case, “another participant's terminal20” is an example of “a first display apparatus” according to thepresent invention, and “participant's terminal 20” from which “sound” isacquired is an example of “a second display apparatus” according to thepresent invention. Content data transmission unit 305 is an example of“a first transmission unit” according to the present invention.

Content output unit 204 of participant's terminal 20 controls theabove-described speaker to output the participant's voice indicated bythe transmitted content data as sound that comes from the directionindicated by the content data. For example, if a range of video in theeast is specified by participant's terminal 20-2, content output unit204 of participant's terminal 20-1 outputs the voice of the participantusing the participant's terminal 20-2 such that the voice sounds as ifit comes from the east.

In the present embodiment, switching instruction acceptance unit 306 ofdistribution server apparatus 30 uses the direction from which the voicecomes from to accept an instruction (a switching instruction) to switchfrom video displayed on participant's terminal 20 to video displayed onanother participant's terminal 20. Therefore, in the present embodiment,terminal orientation measurement unit 202 of participant's terminal 20repeatedly transmits terminal orientation information that has beenmeasured to distribution server apparatus 30 as well.

If terminal orientation information thus transmitted indicates that theorientation of the transmission source participant's terminal 20 is tobe changed to the direction from which a voice recorded by anotherparticipant's terminal 20 comes from, switching instruction acceptanceunit 306 accepts the terminal orientation information as a switchinginstruction that instructs to switch from the video displayed on thetransmission source participant's terminal 20 to the video clipped inthe range specified by participant's terminal 20 that has recorded thevoice.

FIGS. 7A to 7C illustrate examples of switching instructions. In FIGS.7A to 7C, participant's terminals 20-1, 20-2, and 20-3 are used byparticipants A, B, and C. FIG. 7A illustrates that the forward directionof participant A (which is assumed to be the same as the forwarddirection of participant's terminal 20-1) is the north direction, thevoice of participant B comes from the southeast, and the voice ofparticipant C comes from the southwest.

FIG. 7B illustrates that participant A has changed the forward directionof participant's terminal 20-1 worn by participant A to the directionfrom which the voice of participant B comes from (the southeast). Inthis case, terminal orientation information indicating the southeast istransmitted from participant's terminal 20-1, and therefore switchinginstruction acceptance unit 306 accepts the terminal orientationinformation as a switching instruction that instructs to switch from thevideo displayed on participant's terminal 20-1 to the video clipped inthe range specified by participant's terminal 20-2 (the video clippedaccording to the range specified in association with participant'sterminal 20-2).

FIG. 7C illustrates that participant A has changed the forward directionof participant's terminal 20-1 worn by participant A to the directionfrom which the voice of participant C comes from (the southwest). Inthis case, terminal orientation information indicating the southwest istransmitted from participant's terminal 20-1, and therefore switchinginstruction acceptance unit 306 accepts the terminal orientationinformation as a switching instruction that instructs to switch from thevideo displayed on participant's terminal 20-1 to the video clipped inthe range specified by participant's terminal 20-3 (the video clippedaccording to the range specified in association with participant'sterminal 20-3).

Upon accepting the switching instruction, switching instructionacceptance unit 306 supplies content data transmission unit 305 with theterminal ID of participant's terminal 20 that has made the switchinginstruction (the switching source terminal ID of switching sourceparticipant's terminal 20) and the terminal ID of participant's terminal20 that specifies the range of switching destination video (theswitching destination terminal ID of switching destination participant'sterminal 20). Content data transmission unit 305 switches the contentdata that is to be transmitted to participant's terminal 20 indicated bythe supplied switching source terminal ID to content data that istransmitted to participant's terminal 20 indicated by the suppliedswitching destination terminal ID, and transmits it.

For example, if the switching instruction illustrated in FIG. 7B ismade, content data transmission unit 305 switches the content data thatis to be transmitted to participant's terminal 20-1 of participant A tocontent data that is transmitted to participant's terminal 20-2 ofparticipant B, and transmits it. As a result, by turning to thedirection from which the voice of participant B comes, participant A canview the video that is being viewed by participant B.

As described above, in the present embodiment, if an instruction tochange the orientation of switching source participant's terminal 20 tothe direction indicated by content data transmitted to thatparticipant's terminal 20 is made, the above-described contentdistribution unit 307 (video clipping unit 302, specified rangeacquisition unit 303, content generation unit 304, and content datatransmission unit 305) distributes video that is clipped according to arange of video specified in association with switching destinationparticipant's terminal 20, to switching source participant's terminal20.

Switching source participant's terminal 20 is an example of “a firstdisplay apparatus” according to the present invention, and switchingdestination participant's terminal 20 is an example of “a second displayapparatus” according to the present invention. In this way, if a displayswitching instruction is made in association with switching sourceparticipant's terminal 20, content distribution unit 307 distributesvideo that is clipped according to a range of video specified inassociation with switching destination participant's terminal 20, toswitching source participant's terminal 20.

Once content distribution unit 307 has switched content, contentdistribution unit 307 continues distributing the video clipped accordingto the specification by switching destination participant's terminal 20until an instruction (a return instruction) to switch (return) videodisplayed on participant's terminal 20, which is video displayed onanother participant's terminal 20, to the original video is accepted.Switching instruction acceptance unit 306 accepts this returninstruction. For example, if terminal orientation information thatindicates a gesture for returning the video to the original (a returninggesture) is transmitted, switching instruction acceptance unit 306accepts the transmitted terminal orientation information as a returninstruction.

A return gesture is, for example, a gesture made by slightly shakingone's head. In this case, if the azimuth indicated by the terminalorientation information changes twice in a predetermined period, forexample, switching instruction acceptance unit 306 determines that agesture has been made, and accepts a return instruction. Note that areturn gesture may be a gesture made by nodding one's head up and down,moving one's face such that the forward direction of the face moves in acircle, or the like. Also, the user may be allowed to set a movement ofhead as a return gesture.

Also, a gesture may be used as an operation performed to switch to videodisplayed on another participant's terminal 20, in addition to as anoperation performed to return video to the original video. In such acase, any of the above-described plurality of gestures may be used asthe switching operation. Also, switching instruction acceptance unit 306may accept a return instruction not only when a return gesture is made,but also when a predetermined operation (e.g. an operation to touch apredetermined controller) is performed on input unit 25 of participant'sterminal 20, for example.

Upon switching instruction acceptance unit 306 accepting a returninstruction, content data transmission unit 305 switches content datathat is to be transmitted to participant's terminal 20 by which a returninstruction has been made, to content data indicating the terminal ID ofthat participant's terminal 20, and transmits it. Thus, the participantcan return the display from video that is viewed by another participantto video that is in the range specified by the participant.

Based on the above-described configuration, the apparatuses included invirtual travel system 1 perform a provision process through whichvirtual travel content is provided to a participant.

FIG. 8 is illustrates an example of operation sequences carried out byapparatuses in a provision process. These operation sequences arestarted at the start time of the virtual travel service, for example.First, proxy user's terminal 10 (video capturing unit 101, ambient soundrecording unit 102, and camera orientation measurement unit 103)captures 360-degree video around the proxy user, records ambient sound,and measures camera orientation information (step S11).

Next, proxy user's terminal 10 (capture data transmission unit 104)transmits capture data acquired as a result of the operations in stepS11, to distribution server apparatus 30 (step S12). In parallel withsteps S11 and S12, participant's terminal 20 (voice recording unit 201)records the voice of the participant wearing the terminal (step S13),and transmits voice data indicating the recorded voice to distributionserver apparatus 30 (step S14).

Distribution server apparatus 30 (source data acquisition unit 301)acquires the capture data and the voice data transmitted in steps S12and S14, as source data of content that is to be distributed by thatapparatus (step S15). Next, participant's terminal 20 (terminalorientation measurement unit 202) measures information indicating theorientation of the terminal (terminal orientation information) (stepS21). Subsequently, participant's terminal 20 (specified rangedetermination unit 203) determines a specified range of video that is tobe displayed on the terminal, based on the measured terminal orientationinformation (step S22), and transmits range information indicating thedetermined specified range to distribution server apparatus 30 (stepS23).

Distribution server apparatus 30 (specified range acquisition unit 303)acquire the specified range by receiving the transmitted rangeinformation (step S24). Next, distribution server apparatus 30 (videoclipping unit 302) clips video in the acquired specified range out of360-degree video indicated by the source data acquired in step S15 (stepS25). Subsequently, distribution server apparatus 30 (content generationunit 304) generates content data indicating the clipped video, therecorded ambient sound, the recorded voice of the participant, and thedirection corresponding to participant's terminal 20 by which the voicewas recorded (step S26).

Thereafter, distribution server apparatus 30 (content data transmissionunit 305) transmits the generated content data to participant's terminal20 that has specified a range of video (step S27). Participant'sterminal 20 (content output unit 204) outputs content indicated by thetransmitted content data (step S28). At this time, participant'sterminal 20 (content output unit 204) outputs the participant's voiceindicated by the content data as sound that comes from the directionindicated by the content data.

Here, it is assumed that participant's terminal 20 (terminal orientationmeasurement unit 202) measures terminal orientation information when thefront face of the terminal is orientated in the direction from which theparticipant’ voice comes from (step S31). Participant's terminal 20(terminal orientation measurement unit 202) transmits the measuredterminal orientation information to distribution server apparatus 30(step S32). The transmitted terminal orientation information indicatesthat the orientation of participant's terminal 20 at the transmissionsource is to be changed to the direction from which the recorded voiceof another participant's terminal 20 comes, and therefore distributionserver apparatus 30 (switching instruction acceptance unit 306) acceptsthe transmitted terminal orientation information as a switchinginstruction (step S33).

Next, distribution server apparatus 30 (content data transmission unit305) switches content data that is to be transmitted to participant'sterminal 20 at the switching source to content data that is to betransmitted to participant's terminal 20 at the switching destination(step S34), and transmits the content data to participant's terminal 20at the switching source (step S35). Participant's terminal 20 (contentoutput unit 204) at the switching source outputs content indicated bythe transmitted content data (step S36).

After step S36, upon measuring terminal orientation informationindicating the above-described return gesture (step S41), participant'sterminal 20 (terminal orientation measurement unit 202) transmits themeasured terminal orientation information to distribution serverapparatus 30 (step S42). Because the transmitted terminal orientationinformation indicates a return gesture, distribution server apparatus 30(switching instruction acceptance unit 306) accepts the transmittedterminal orientation information as a return instruction (step S43).

Next, distribution server apparatus 30 (content data transmission unit305) switches content data that is to be transmitted to participant'sterminal 20 at which the return instruction was made to content datathat indicates video for which a range was specified by thatparticipant's terminal 20 (step S44), and transmits the content data tothat participant's terminal 20 (step S45). Participant's terminal 20(content output unit 204) outputs content indicated by the transmittedcontent data (step S46).

In the present embodiment, each user (participant) specifies a range ofvideo that he/she wishes to view, by changing the direction in which thefront face of participant's terminal 20 worn by him/her is orientated.On the other hand, upon the front face of participant's terminal 20being orientated in the direction from which the voice of another usercomes, video in the range specified by the user is displayed. As aresult, it is easier to switch to video that is viewed by another usercompared to a case in which the user asks the other user about the rangeof video that is viewed by the other user, for example.

Also, in the present embodiment, another user's voice comes from thedirection indicated by the range of video specified by participant'sterminal 20 of the other user, and video viewed by the participant isdisplayed when the user turns to the direction. As a result, byswitching video, the user can discern the direction in which the otherviewer viewing the switching destination video is orientated.

2. Variations

The above-described embodiment is merely an example for carrying out thepresent invention, and the following variations are possible as well.

2-1. To Display Content in the Past

For example, a participant hears a joyful voice of another participant,and views video that is the same as video viewed by the otherparticipant. However, a scene that has attracted the other participant'sattention (e.g. a puppet show performed on an automation clock) may havebeen finished when the participant see it. Therefore, content (video andsound) that was output to participant's terminal 20 of the otherparticipant in the past may be output.

FIG. 9 illustrates a functional configuration realized by distributionserver apparatus 30 a according to the present variation. Distributionserver apparatus 30 a includes content accumulation unit 308 in additionto the units illustrated in FIG. 5. In the present variation, contentgeneration unit 304 supplies generated content data to contentaccumulation unit 308 as well.

Content accumulation unit 308 stores supplied content data, and thusaccumulates content distributed to participant's terminal 20 with theterminal ID indicated by the content data. By accumulating content,content accumulation unit 308 also accumulates 360-degree video acquiredby source data acquisition unit 301. Content accumulation unit 308 is anexample of “an accumulation unit” according to the present invention.Content accumulation unit 308 also accumulates ambient sound and thevoice of the participant as well as 360-degree video.

In the present variation, for example, if terminal orientationinformation transmitted from participant's terminal 20 indicates thatthe orientation of the transmission source participant's terminal 20 isto be changed to the direction from which a voice recorded by anotherparticipant's terminal 20 comes from, switching instruction receptionunit 306 accepts the terminal orientation information as a switchinginstruction that instructs to switch from the video displayed on thetransmission source participant's terminal 20 to the clipped video inthe range that was specified in the past (e.g. at a past point in timethat is backward by a period of time) by participant's terminal 20 thathas recorded the voice.

Content distribution unit 307 distributes, to switching sourceparticipant's terminal 20, video that was clipped in the past accordingto a range of video specified in association with switching destinationparticipant's terminal 20 out of video accumulated in contentaccumulation unit 308. Also, content distribution unit 307 distributes,to switching source participant's terminal 20, ambient sound in the pastand the participant's voice in the past as well as the video clipped inthe past. Thus, even after a scene that has attracted anotherparticipant's attention has ended, a participant can enable his or herown participant's terminal 20 to output content that represents thescene.

Note that content distribution unit 307 may distribute content as of apast point in time that is backward by a period of time specified by theparticipant, instead of content as of a past point in time that isbackward by a predetermined period of time. The length of the period oftime in the backward direction may be specified using a gesturerepresented by terminal orientation information (e.g. the periodincreases in the backward direction as the number of times theparticipant shakes his or her head increases) as with theabove-described return instruction, or specified by performing apredetermined operation on input unit 25 of participant's terminal 20(e.g. the period increases in the backward direction as the number oftouches increases), for example.

Content distribution unit 307 may first distribute the content that iscurrently distributed to switching destination participant's terminal 20to switching source participant's terminal 20 as in the embodiment, anddistributes the past content when the length of the period of time inthe backward direction is specified. As a result, even if a participantcauses participant's terminal 20 to output the same content as anotherparticipant, but finds out that a scene that has attracted theparticipant's attention has ended, the participant can thereafter causehis or her own participant's terminal 20 to output content thatrepresents the scene.

In the example in FIG. 9, content accumulation unit 308 accumulatescontent that indicates clipped video. However, content accumulation unit308 may accumulate source data, for example. In this case, videoclipping unit 302 clips video in the range specified by switchingdestination participant's terminal 20 out of 360-degree video indicatedby the source data accumulated in content accumulation unit 308, andcontent generation unit 304 generates content that includes the clippedvideo.

Content distribution unit 307 distributes content thus generated toswitching source participant's terminal 20. In this case, it is notnecessary to accumulate content for all participant's terminals 20, andit is only necessary to store source data. Therefore, compared toaccumulate clipped content, it is possible to suppress an increase inthe required storage capacity when the number of participant's terminals20 increases.

2-2. Direction of Sound

In the Embodiment, a direction indicated by the range of video specifiedby participant's terminal 20 is used as the direction from which anotherparticipant's voice comes, i.e. the direction corresponding toparticipant's terminal 20. For example, when participant B is viewingvideo of objects in the southwest, the speaker of participant's terminal20-1 of participant A emits the voice of participant B as sound comingfrom the southwest. However, the present invention is not limited inthis way.

For example, if there is one other participant, sound may be emittedsuch that sound comes from one predetermined direction (e.g. fromdirectly above), and if there are a plurality of other participants,sound may be emitted such that sound comes from a plurality ofpredetermined directions (e.g. directions are determined to be at equalintervals such that the directions of sound can be easily distinguishedfrom each other). Even in this case, by turning to the direction fromwhich a voice comes, participant can view video that is the same asvideo viewed by another participant who uttered the voice.

2-3. Switching Instruction

In the embodiment, in order to make a switching instruction, a directioncorresponding to another participant is represented as a direction fromwhich sound comes. However, it may be represented using an image. Ifthis is the case, content generation unit 304 generates content thatindicates video that includes an image that indicates the directioncorresponding to switching destination participant's terminal 20 (adirection instructing image).

FIG. 10 illustrates an example of video output according to the presentvariation. FIG. 10 illustrates video D1 that is output on participant'sterminal 20 of participant A, which includes direction instructing imageE1 corresponding to participant B and direction instructing image E2corresponding to participant C. In this example, the upper half of videoD1 relative to the center in the vertical direction represents a forwarddirection viewed from participant A, the lower half represents a reardirection viewed from participant, the right half relative to the centerin the horizontal direction represents the right side viewed fromparticipant, and the left half represents the right side viewed fromparticipant A.

In the example in FIG. 10, it is assumed that the forward directioncoincides with the north direction. Direction instructing image E1 is anarrow directed to the right at the center in the vertical direction,which indicates that participant B faces to the right relative toparticipant A (i.e. to the east). Direction instructing image E2 is anarrow directed to the left on the lower side in the vertical direction,which indicates that participant C faces to the left and rearwardrelative to participant A (i.e. to the southwest).

Content data transmission unit 305 transmits content data indicatingvideo including a direction instructing image indicating the directioncorresponding to switching destination participant's terminal 20 asillustrated in FIG. 10, to switching source participant's terminal 20.In this case, content data transmission unit 305 is an example of “asecond transmission unit” according to the present invention. In thepresent variation, switching instruction acceptance unit 306 accepts aswitching instruction using the direction indicated by this directioninstructing image.

When the transmitted terminal orientation information indicates that theorientation of transmission source participant's terminal 20 is to bechanged to the direction indicated by the direction instructing image(the direction corresponding to another participant's terminal 20),switching instruction acceptance unit 306 accepts video displayed ontransmission source participant's terminal 20 as a switching instructionthat instructs to switch to video that is clipped according to the rangespecified by participant's terminal 20 corresponding to the direction.

In the example in FIG. 10, if terminal orientation information thatinstructs to change the orientation of the terminal to the east istransmitted from participant's terminal 20-1 of participant A, switchinginstruction acceptance unit 306 accepts the terminal orientationinformation as a switching instruction that instructs to switch thevideo displayed on participant's terminal 20-1 to video clipped in therange specified by participant's terminal 20-2 of participant Bcorresponding to the direction. If terminal orientation information thatinstructs to change the orientation of the terminal to the southwest istransmitted, switching instruction acceptance unit 306 accepts theterminal orientation information as a switching instruction thatinstructs to switch the video to video clipped in the range specified byparticipant's terminal 20-3 of participant C.

In the present variation, if an instruction is made to change theorientation of switching source participant's terminal 20 to thedirection indicated by the direction instructing image transmitted toswitching source participant's terminal 20 by content data transmissionunit 305, content distribution unit 307 distributes video clippedaccording to the video range specified in association with switchingdestination participant's terminal 20 to switching source participant'sterminal 20. In this case, the direction to which participant shouldface is indicated by only using an image. Therefore, it is possible tomake a switching instruction even in a noisy situation where it isdifficult to hear sound.

2-4. Narrowing Down Other Participants

When there are many participants, if the voices of all of theparticipants can be heard, it is difficult to discern each voice. Also,even in a case where direction instructing images are displayed as inthe example in FIG. 10, if the number of direction instructing imagesincreases, it becomes difficult to view travel destination video, whichis the main content. Therefore, other participants of which voices ordirection instructing images are to be included in content may benarrowed down.

FIG. 11 illustrates a functional configuration realized by virtualtravel system 1 b according to the present variation. Virtual travelsystem 1 b includes participant's terminal 20 b and distribution serverapparatus 30 b. Participant's terminal 20 b includes terminalinformation transmission unit 205 in addition to the units illustratedin FIG. 5, and distribution server apparatus 30 b includes terminalinformation acquisition unit 309 in addition to the units illustrated inFIG. 5. Terminal information transmission unit 205 generates informationregarding the terminal (terminal information) and transmits it todistribution server apparatus 30 b.

Sensor unit 27 of participant's terminal 20 b includes a positioningsensor such as a GPS (Global Positioning System). Terminal informationtransmission unit 205 generates position information indicating theposition of the terminal measured by the positioning sensor as terminalinformation, and transmits it to distribution server apparatus 30 b.Terminal information acquisition unit 309 of distribution serverapparatus 30 b acquires the transmitted terminal information, i.e.,position information indicating the position of participant's terminal20 b. In this case, terminal information acquisition unit 309 is anexample of “a position acquisition unit” according to the presentinvention.

Terminal information acquisition unit 309 supplies the acquired terminalinformation to content generation unit 304 and switching instructionacceptance unit 306. Content generation unit 304 generates, as contentthat is to be distributed to participant's terminal 20 b, content thatincludes a voice recorded by participant's terminal 20 b of whichposition information indicating a position that is at a distance withina predetermined range to the aforementioned participant's terminal 20 bhas been acquired as terminal information.

For example, it is assumed that the predetermined range is 50 m,participants A and B participate in a virtual travel service in the sameroom, and participant C participates in the virtual travel service in adifferent location that is away from the room by 50 m or more. If thisis the case, the distance between the positions indicated by positioninformation acquired from participant's terminals 20 b of participants Aand B is within the predetermined range, and the distance between thepositions indicated by position information acquired from participant'sterminals 20 b of participants A and C is out of the predeterminedrange.

In this case, content generation unit 304 generates, as content that isto be distributed to participant's terminal 20 b of participant A,content that includes the voice of participant B recorded byparticipant's terminal 20 b and does not include the voice ofparticipant C recorded by participant's terminal 20 b. As a result,participant A can only hear the voice of the participant (participant Bin this example) near participant A.

As described above, participants whose voices can be heard are narroweddown. Therefore, switching instruction acceptance unit 306 accepts aswitching instruction upon receiving terminal orientation informationindicating that the orientation of participant's terminal 20 b is to bechanged to the direction from which the voices of the participants thusnarrowed down come from. As described above, content distribution unit307 determines participant's terminal 20 b from which positioninformation indicating a position that is at a distance within thepredetermined range to switching source participant's terminal 20 b hasbeen acquired, as switching destination participant's terminal 20 b, anddistributes content (distributes content that is to be distributed toswitching destination participant's terminal 20 b to switching sourceparticipant's terminal 20 b as well).

In the present variation, as described above, content that is to bedistributed to a participant is switched only to content that is to bedistributed to other participants at a distance within a predeterminedrange from the participant. Therefore, a participant can shareexperience only with participants in the range. Also, compare to a casewhere voices of all participants can be heard, it is easier to discernthe voice of each participant. Also, compared to a case where thedirection instructing images of all of the other participants aredisplayed, it is easier to view video.

Note that participants may be narrowed down in another way. For example,since anyone can participate in a virtual travel service, there may be aparticipant who speaks in a different language. In such a case,participants may be narrowed down to those who speak in the samelanguage. In this case, terminal information transmission unit 205 ofparticipant's terminal 20 b generates, as terminal information, languageinformation indicating the languages that has been set to the terminal(e.g. the language that is displayed on the menu screen), and transmitsit to distribution server apparatus 30 b.

Terminal information acquisition unit 309 of distribution serverapparatus 30 b acquires the transmitted terminal information, i.e.,language information indicating the language used in participant'sterminal 20 b. In this case, terminal information acquisition unit 309is an example of “a language acquisition unit” according to the presentinvention. Content generation unit 304 generates, as content that is tobe distributed to participant's terminal 20 b, content that includes avoice recorded by participant's terminal 20 b of which languageinformation indicating the same language as the language used in theaforementioned participant's terminal 20 b has been acquired as terminalinformation.

As a result, a participant can hear the voices of participants who speakthe language that is the same as the language (to be exact, the languagethat has been set to his or her own participant's terminal 20 b) that isspoken by the participant (to be exact, a participant can hear thevoices of participants who have set to participant's terminals 20 b thesame language as the language set by the participant). Switchinginstruction acceptance unit 306 accepts a switching instruction whenterminal orientation information indicating that the orientation ofparticipant's terminal 20 b is to be changed to the direction from whichthe voices of the participants thus narrowed down come from.

As described above, content distribution unit 307 determinesparticipant's terminal 20 b of which the language information acquiredtherefrom indicates the same language as the language used in switchingsource participant's terminal 20 b, as switching destinationparticipant's terminal 20 b, and distributes content. Thus, content thatis to be distributed to a participant can be switched only to contentthat is to be distributed to other participants who speak the samelanguage as the participant. Also, by narrowing down the participants,it becomes easier to discern the voice of each participant, and itbecomes easier to view video in the same manner as in the case whereposition information is used.

Note that the method for acquiring language information is not limitedto that described above. For example, terminal information acquisitionunit 309 may acquire, as terminal information, language information thathas been registered in advance by a participant or the provider of avirtual travel service. Also, terminal information acquisition unit 309may acquire terminal information indicating an IP address that has beenassigned to participant's terminal 20 b. The location of participant'sterminal 20 b can be identified using the IP address. Therefore,terminal information acquisition unit 309 acquires the IP address aslanguage information indicating a standard language used in a specifiedlocation.

Participants may be narrowed down in another way. For example, since aparticipant in any region in the world can participate in a virtualtravel service, and there may be a participant who lives in a region ina different standard time zone. For example, daytime in a region whereone participant lives may be night time in a region where anotherparticipant lives. For example, in a case where a virtual travel serviceis provided for a long period and participants can participate in orwithdraw from the service in the middle, the times at which theparticipants participate in the service become more likely to bedifferent from each other as the difference between the standard timezones of the regions in which the participant live increases. Therefore,participants may be narrowed down to those who live in regions insimilar standard time zones.

In this case, terminal information transmission unit 205 ofparticipant's terminal 20 b generates, as terminal information, timezone information indicating the standard time zone that has been set tothe terminal, and transmits it to distribution server apparatus 30 b.Terminal information acquisition unit 309 of distribution serverapparatus 30 b acquires the transmitted terminal information, i.e., timezone information indicating the standard time zone used at the positionof participant's terminal 20 b. In this case, terminal informationacquisition unit 309 is an example of “a time zone acquisition unit”according to the present invention.

Note that time zone information is not limited to information thatindicates the standard time zone that has been set, and may beinformation indicating the position of the terminal (because thestandard time zone can be known from the position). Content generationunit 304 generates, as content that is to be distributed toparticipant's terminal 20 b, content that includes a voice recorded byparticipant's terminal 20 b of which the standard time zone indicated bythe time zone information acquired therefrom has a time difference thatis smaller than a threshold value when compared to the standard timezone indicated by the time zone information acquired from theaforementioned participant's terminal 20 b.

For example, in a case of content that is to be distributed toparticipant's terminal 20 b of participant A, if the time differencebetween standard time zones indicated by time zone information acquiredfor participant's terminal 20 b of participant A and time zoneinformation acquired for participant's terminal 20 b of participant B issmaller than the threshold value, content generation unit 304 generatescontent that includes the voice recorded by participant's terminal 20 bof participant B.

As a result, a participant can only hear the voices of participants inthe region in the same standard time zone as the participant. Switchinginstruction acceptance unit 306 accepts a switching instruction whenterminal orientation information indicating that the orientation ofparticipant's terminal 20 b is to be changed to the direction from whichthe voices of the participants thus narrowed down according to thestandard time zones of the regions where the participants are presentcome from.

As described above, content distribution unit 307 determinesparticipant's terminal 20 b of which the time zone information acquiredtherefrom indicates a standard time zone that has a time differencesmaller than the threshold value when compared to the standard time zoneof the time zone information acquired from switching sourceparticipant's terminal 20 b, as switching destination participant'sterminal 20 b, and distributes content. Thus, content that is to bedistributed to a participant can be switched only to content that is tobe distributed to other participants in regions in standard time zonesthat are similar to the standard time zone of the participant. Also, bynarrowing down the participants, it becomes easier to discern the voiceof each participant, and it becomes easier to view video in the samemanner as in the case where position information is used.

2-5. Participation in a Group

It may be possible to participate in a virtual travel service in agroup. In such a case, for example, any of the users who belong to agroup serves as a representative, and when applying to participate inthe service, the representative registers information regarding otherparticipants in the group, such as families or friends who travel withthe representative. Content distribution unit 307 issues a group ID thatidentifies the registered group.

If there is identification information that is used in an existingservice, it may be used as the group ID. In the present variation, thefunctional configuration illustrated in FIG. 11 is realized.Distribution server apparatus 30 b notifies participant's terminal 20 bof the group ID when notifying it of a participation call or the startof a travel. Terminal information transmission unit 205 of participant'sterminal 20 b transmits the notified group ID to distribution serverapparatus 30 b as terminal information.

Content generation unit 304 of distribution server apparatus 30 bgenerates, as content that is to be distributed to participant'sterminal 20 b, content that includes a voice recorded by participant'sterminal 20 b from which the same group ID has been acquired as terminalinformation. As a result, a participant can only hear the voices ofother participants belonging to the same group, and a participant canshare experience only with these participants.

2-6. Plurality of Proxy user's Terminals

There may be a plurality of proxy user's terminals that capture video ata travel destination.

FIG. 12 illustrates an overall configuration of virtual travel system 1c according to the variation. Virtual travel system 1 c includes proxyuser's terminals 10 c-1, 10 c-2, 10 c-3, and so on (hereinafter referredto as “proxy user's terminals 10 c when they are not distinguished fromeach other), instead of proxy user's terminal 10 illustrated in FIG. 1.Every proxy user's terminal 10 c has the same hardware configuration andfunctional configuration as proxy user's terminal 10 illustrated in FIG.1.

FIG. 13 illustrates a functional configuration realized by proxy user'sterminal 10 c according to the present variation. Proxy user's terminal10 c includes terminal position measurement unit 105 in addition to theunits illustrated in FIG. 5. Terminal position measurement unit 105measures the position of the terminal, using technology such as GPS.Terminal position measurement unit 105 repeatedly measures the positionof the terminal while simultaneously supplying position informationindicating the measured position to capture data transmission unit 104.

Capture data transmission unit 104 transmits the supplied dataindicating the position information to distribution server apparatus 30as capture data. Source data acquisition unit 301 of distribution serverapparatus 30 acquires capture data transmitted from proxy user'sterminal 10 c as source data, and thus acquires the position informationindicating the position of proxy user's terminal 10 c indicated by thecapture data.

An operation to select proxy user's terminal 10 c that is to capturevideo that is to be included in the content that is to be distributed isperformed on each participant's terminal 20. This operation may beperformed using input unit 25, or performed through a predeterminedgesture represented by terminal orientation information. Specified rangedetermination unit 203 of participant's terminal 20 transmits rangeinformation indicating the terminal ID of proxy user's terminal 10 cthus selected, to distribution server apparatus 30, in addition to thedetermined specified range.

Specified range acquisition unit 303 receives the transmitted rangeinformation, thereby acquiring the range of video specified in the videoof proxy user's terminal 10 c identified by the terminal ID indicated bythe range information, i.e. the specified range. Video clipping unit 302clips the range of video specified by the acquired range information,i.e. the specified range in the video of proxy user's terminal 10 cselected by the participant. Content generation unit 304 generatescontent data that indicates video thus clipped out, i.e. video clippedout of video captured by proxy user's terminal 10 c selected by theparticipant.

Content data transmission unit 305 transmits the generated content datato participant's terminal 20. As described above, in the presentvariation, content distribution unit 307 distributes content thatincludes video clipped out of the video captured by proxy user'sterminal 10 c selected by a participant, to participant's terminal 20 ofthe participant. Also, while content is being output, a participant maychange a proxy user (proxy user's terminal 10 c) to another proxy userwho captures video that the participant wishes to view. The followingdescribes an example in which such a proxy user change is performedusing a predetermined gesture represented by terminal orientationinformation.

FIG. 14 illustrates an example of a positional relationship betweenproxies. In the example in FIG. 14, participant A is viewing videocaptured by proxy user's terminal 10 c of proxy user α, and an areadefined such that the north-northwest direction from the position of theproxy user α coincides with virtual sight direction B3 is specified.Here, in a case where virtual sight direction B3 when seen from theproxy user α is defined as the forward direction, if an area locatedforward and on the right is defined as area G1, an area located forwardand on the left is defined as area G2, an area located rearward and onthe left is defined as area G3, and an area located rearward and on theright is defined as area G4, proxy user β is located in area G2, andproxy user γ is located in area G4.

Content generation unit 304 determines the positional relationship of aplurality of proxy user's terminals 10 c based on the positions of proxyuser's terminals 10 c indicated by source data (capture data) acquiredtherefrom and the azimuth of the virtual sight direction (virtual sightdirection B3 in this example) indicated by the range informationacquired from participant's terminal 20. In the case of the positionalrelationship illustrated in FIG. 14, content generation unit 304generates content that indicates video illustrated in FIG. 15, forexample.

FIG. 15 illustrates an example of generated video. The example in FIG.15 illustrates video D2 that is displayed when participant A looks up atthe sky. Video D2 is video captured by proxy user α and viewed byparticipant A. In video D2, video area H1 on the upper right side isassociated with area G1 illustrated in FIG. 14, video area H2 on theupper left side is associated with area G2, video area H3 on the lowerleft side is associated with area G3, and video area H4 on the lowerright side is associated with area G4.

Based on the positional relationship illustrated in FIG. 14, contentgeneration unit 304 generates content that indicates video D2 in whichswitching image F1 that indicates switching to video captured by proxyuser β located in area G2 is superimposed on video area H2, andswitching image F2 that indicates switching to video captured by proxyuser γ located in area G4 is superimposed on video area H4. Switchingimages F1 and F2 have the shapes of arrows directed to proxies β and γ(directed to the left and the right), and indicate the left or the rightdirection in which each proxy user is present.

When generating content that includes switching images as illustrated inFIG. 15, content generation unit 304 supplies the terminal IDs of proxyuser's terminals 10 c indicated by the switching images and thedirections indicated by the switching images to switching instructionacceptance unit 306. It is assumed that terminal orientation informationthat instructs to change the orientation of the terminal to thedirection indicated by a switching image is transmitted to distributionserver apparatus 30 from participant's terminal 20 to which the contenthas been distributed.

In such a case, switching instruction acceptance unit 306 accepts theterminal orientation information as a switching instruction thatinstructs to switch from the video displayed on that participant'sterminal 20 to video that is to be clipped out of the video captured byproxy user's terminal 10 c of the proxy user indicated by the switchingimage. In the example in FIG. 15, if terminal orientation informationthat instructs to change the terminal orientation to the left (or theright) is transmitted, switching instruction acceptance unit 306 acceptsthe terminal orientation information as a switching instruction thatinstructs to switch to the video that is to be clipped out of the videocaptured by proxy user's terminal 10 c of proxy user β (or proxy userγ).

Although the switching images in the example in FIG. 15 have the shapesof arrows directed to the left and the right, a switching image may havethe shape of a diagonal arrow as illustrated in the example in FIG. 10.In the example in FIG. 14, if virtual sight direction B3 is defined asthe forward direction, proxy user β is located forward of and to theleft of proxy user α. Therefore, switching image F1 has the shape of anarrow that diagonally points to the top left of the screen. Thus, it ispossible to more accurately indicate the directions of other proxies.

Upon accepting a switching instruction, switching instruction acceptanceunit 306 supplies the terminal ID of switching destination proxy user'sterminal 10 c to video clipping unit 302. Video clipping unit 302 clipsthe specified area out of the video captured by proxy user's terminal 10c indicated by the supplied terminal ID. Thus, content that includesvideo captured by the switching destination proxy user is distributed.

Locations in which a plurality of proxies capture video may be differentlocations in the same tourist spot. (e.g. a location near the entrance,a location in the center, and a location near the exit of a park) orlocations in different tourist spots. In any case, a participant canspecify video that he/she wishes to view from a wide range of videocompared to when there is only one proxy user. Note that the method forswitching to video captured by another proxy user is not limited to thismethod.

For example, content generation unit 304 generates content in which thevoices of other participants, who are viewing video captured by proxies,come from directions in which the proxies are present. In the example inFIG. 14, content generation unit 304 generates content in which thevoice of a participant viewing video captured by proxy user β comes fromthe front left, and the voice of a participant viewing video captured byproxy user γ comes from the rear right. In this case, as in theembodiment, participant's terminal 20 of participant A may be orientatedtoward the direction from which a voice comes, and thus switching tovideo captured by another proxy user located in that direction may becarried out.

2-7. Narrowing Down Other Participants (Part 2)

Other participants may be narrowed down based on video captured by whichproxy user's terminal 10 c, among the plurality of aforementioned proxyuser's terminals 10 c, is to be displayed. For example, contentdistribution unit 307 determines participant's terminal 20 that isdisplaying video captured by proxy user's terminal 10 c that is the sameas proxy user's terminal 10 c that has captured the video displayed byswitching source participant's terminal 20, as switching destinationparticipant's terminal 20, and distributes content thereto.

In this case, specified range acquisition unit 303 supplies the terminalID indicating participant's terminal 20 that has transmitted the rangeinformation indicating the same terminal ID to content generation unit304 and switching instruction acceptance unit 306. Content generationunit 304 generates, as content that is to be distributed toparticipant's terminal 20, content that includes a voice recorded byparticipant's terminal 20 that is displaying video captured by proxyuser's terminal 10 c that is the same as proxy user's terminal 10 c thathas captured the video displayed by the aforementioned participant'sterminal 20.

As a result, a participant can only hear the voices of participants whoare viewing the video captured by the same proxy user as the proxy userof the participant. Thus, participants whose voices can be heard arenarrowed down. Therefore, switching instruction acceptance unit 306accepts a switching instruction upon receiving terminal orientationinformation indicating that the orientation of participant's terminal 20is to be changed to the direction from which the voices of theparticipants thus narrowed down come from.

According to the present variation, content that is to be distributed toa participant can be switched only to content that is to be distributedto other participants who are viewing video captured by the same proxyuser as the proxy user of the participant. Note that the participantsare not necessarily narrowed down to those who are viewing videocaptured by the same proxy user, and may be narrowed down to those whoare viewing video captured by proxies located near each other. In thepresent variation, the functions realized by proxy user's terminal 10 cillustrated in FIG. 13 are used. In the present variation, positioninformation acquired by source data acquisition unit 301 is informationindicating a position where the video indicated by the acquired sourcedata was captured.

This position information is an example of “capture positioninformation”, and source data acquisition unit 301 that captured theposition information is an example of “a capture position acquisitionunit”. Content generation unit 304 generates, as content that is to bedistributed to participant's terminal 20, content that includes a voicerecorded by participant's terminal 20 that is displaying video capturedby proxy user's terminals 10 c from which position informationindicating a position at a distance within a predetermined range toproxy user's terminal 10 c that has captured video displayed by theaforementioned participant's terminal 20 (including proxy user'sterminal 10 c that has captured video displayed by the aforementionedparticipant's terminal 20).

For example, participant A is viewing video captured by proxy user α,participant B is viewing video captured by proxy user β, and participantC is viewing video captured by proxy user γ. It is also assumed thatproxies α and β are capturing video in the same park (the distancebetween them is within the predetermined range), and proxy user γ iscapturing video in another town (the distance to proxy user α is out ofthe predetermined range).

In such a case, content generation unit 304 generates, as content thatis to be distributed to participant's terminal 20 of participant A,content that includes the voice recorded by participant's terminal 20 ofparticipant B that is displaying video captured by proxy user's terminal10 c of proxy user β, but does not include the voice recorded byparticipant's terminal 20 of participant C. As a result, participant Acan only hear the voice of participant B who is viewing the videocaptured by the proxy user who is at a distance within the predeterminedranged to the proxy user who has captured the video that participant Ais viewing.

As a result of the above-described content data being transmitted, whenthere is a proxy user's terminal 10 c from which position informationindicating a position that is at a distance within the predeterminedrange to proxy user's terminal 10 c that has captured video that isdisplayed by switching source participant's terminal 20 has beenacquired, content distribution unit 307 determines participant'sterminal 20 that is displaying the video captured by that proxy user'sterminal 10 c as switching destination participant's terminal 20, anddistributes content. As a result, content that is to be distributed toparticipants can be switched only to content that is to be distributedto participants who are viewing video captured by proxies who arelocated at a distance within a predetermined range to the proxy user(including the proxy user himself/herself) who captured the video thatis being viewed by the aforementioned participant.

2-8. Still Image Capturing

Although images displayed on participant's terminal 20 are movingimages, a favorite picture may be captured and stored as a still image.

FIG. 16 illustrates a functional configuration realized by participant'sterminal 20 d according to the present variation. Participant's terminal20 d includes capture operation acceptance unit 206 and still imagestorage unit 207 in addition to the units illustrated in FIG. 5.

Capture operation acceptance unit 206 accepts a capture operationperformed by a participant. Capture operation is an operation to capturean image like a picture from video that is being displayed. A captureoperation may be performed using input unit 25, or performed through apredetermined gesture represented by terminal orientation information.Upon accepting a capture operation, capture operation acceptance unit206 notifies the acceptance of still image storage unit 207.

Upon being notified of the acceptance of the capture operation, stillimage storage unit 207 captures an image from the content output fromcontent output unit 204, and stores the image as a still image. In thisway, a participant can capture a scene that they like at a traveldestination, and store and enjoy an image of that scene in the same waythey would a photograph.

2-9. Participant's Terminal

In the embodiment, a head mounted display is used as a participant'sterminal. However, a participant's terminal is not limited to a headmounted display. For example, a smartphone, a tablet terminal, apersonal computer, or the like may be used as a participant's terminal.That is to say, any apparatus that can output distributed content (thatcan at least display video) may be used as a participant's terminal.

2-10. Apparatuses that Realize Units

Apparatuses that realize the functions illustrated in FIG. 5 and so onare not limited to the apparatus illustrated in the figures. Forexample, specified range determination unit 203 included inparticipant's terminal 20 may be realized by distribution serverapparatus 30. Also, although switching instruction acceptance unit 306accepts both a switching instruction and a return instruction, theseinstructions may be accepted by separate functions. In addition, thefunctions of distribution server apparatus 30 may be separately realizedby two or more apparatuses. That is to say, the virtual travel systemmay include any number of apparatuses as long as the functions can berealized in the virtual travel system overall.

2-11. System

Although above each example describes a system for providing a virtualtravel service (a virtual travel system), the present invention isapplicable to another system. For example, the present invention may beapplied to a system for virtually viewing musical performance, a sportsgame, an art exhibition, and so on. Also, the present invention may beapplied to a system that distributes content that includes videocaptured by a robot or a drone. That is to say, the present inventionmay be applied to any system as long as it allows each of a plurality ofusers to view a range of video specified by the user out of large videosuch as 360-degree video or a plurality of pieces of video.

2-12. Categories of the Invention

The present invention may be understood as information processingapparatuses such as the proxy user's terminals, the participant'sterminals, and the distribution server apparatus, as well as aninformation processing system, such as the virtual travel system thatincludes these apparatuses. The present invention can also be understoodas an information processing method for implementing the processingexecuted by the respective apparatuses, as well as a program for causinga computer that controls the respective apparatuses to function. Theprogram may be provided by being stored in a recording medium such as anoptical disk or the like, or may be provided by being downloaded to acomputer over a network such as the Internet and being installed so asto be usable on that computer.

2-13. Processing Sequences, Etc.

The processing sequences, procedures, flowcharts, and the like of theembodiments described in the specification may be carried out indifferent orders as long as doing so does not create conflict. Forexample, the methods described in the specification present the elementsof a variety of steps in an exemplary order, and the order is notlimited to the specific order presented here.

2-14. Handling of Input/Output Information, Etc.

Information and the like that has been input/output may be saved in aspecific location (e.g., memory), or may be managed using a managementtable. The information and the like that has been input/output can beoverwritten, updated, or added to. Information and the like that hasbeen output may be deleted. Information and the like that has been inputmay be transmitted to other apparatuses.

2-15. Software

Regardless of whether software is referred to as software, firmware,middleware, microcode, hardware description language, or by anothername, “software” should be interpreted broadly as meaning commands,command sets, code, code segments, program code, programs, sub programs,software modules, applications, software applications, softwarepackages, routines, subroutines, objects, executable files, executionthreads, sequences, functions, and so on.

Additionally, software, commands, and so on may be exchanged over atransmission medium. For example, when software is transmitted from awebsite, a server, or another remote source using hardwired technologiessuch as coaxial cable, fiber optic cable, twisted pair cabling, ordigital subscriber line (DSL), and/or wireless technologies such asinfrared light, radio waves, or microwaves, these hardwired technologiesand/or wireless technologies are included in the definition of“transmission medium”.

2-16. Information and Signals

The information, signals, and so on described in the specification maybe realized using any of a variety of different techniques. For example,data, instructions, commands, information, signals, bits, symbols,chips, and so on that may be referred to throughout all of the foregoingdescriptions may be realized by voltages, currents, electromagneticwaves, magnetic fields or magnetic particles, photo fields or photons,or any desired combination thereof

2-17. Systems and Networks

The terms “system” and “network” used in the specification can be usedinterchangeably.

2-18. Meaning of “Based on”

The phrase “based on” used in the specification does not mean “basedonly on” unless specifically mentioned. In other words, the phrase“based on” means both “based only on” and “based at least on”.

2-19. “And” and “Or”

In the specification, with respect to configurations that can berealized both as “A and B” and “A or B”, a configuration described usingone of these phrases may be used as a configuration described by theother of these phrases. For example, if the phrase “A and B” is used, “Aor B” may be used as long as implementation is possible withoutconflicting with the other phrase.

2-20. Variations, Etc. on Embodiments

The embodiments described in the specification may be used alone, may becombined, or may be switched according to how the invention is to becarried out. Additionally, notifications of predetermined information(e.g., a notification that “X is true”) are not limited to explicitnotifications, and may be carried out implicitly (e.g., the notificationof the predetermined information is not carried out).

Although the foregoing has described the present invention in detail, itwill be clear to one skilled in the art that the present invention isnot intended to be limited to the embodiments described in thespecification. The present invention may be carried out in modified andaltered forms without departing from the essential spirit and scope ofthe present invention set forth in the appended scope of patent claims.As such, the descriptions in the specification are provided fordescriptive purposes only, and are not intended to limit the presentinvention in any way.

REFERENCE SIGNS LIST

1 . . . Virtual travel system, 10 . . . Proxy user's terminal, 20 . . .Participant's terminal, 30 . . . Distribution server apparatus, 101 . .. Video capturing unit, 102 . . . Ambient sound recording unit, 103 . .. Camera orientation measurement unit, 104 . . . Capture datatransmission unit, 105 . . . Terminal position measurement unit, 201 . .. Voice recording unit, 202 . . . Terminal orientation measurement, 203. . . Specified range determination unit, 204 . . . Content output unit,205 . . . Terminal information transmission unit, 206 . . . Captureoperation acceptance unit, 207 . . . Still image storage unit, 301 . . .Source data acquisition unit, 302 . . . Video clipping unit, 303 . . .Specified range acquisition unit, 304 . . . Content generation unit, 305. . . Content data transmission unit, 306 . . . Switching instructionacceptance unit, 307 . . . Content distribution unit, 308 . . . Contentaccumulation unit, 309 . . . Terminal information acquisition unit

What is claimed is:
 1. An information processing apparatus comprising:an image acquisition unit that acquires a captured image; and adistribution unit that, upon receiving a range of image specified inassociation with a first display apparatus of a first user, clips afirst partial image out of the captured image, in the range from theacquired captured image and distributes the clipped first partial imageto the first display apparatus, and upon a display switching instructionbeing made in association with the first display apparatus, distributesa second partial image clipped out of the captured image in a rangespecified in association with a second display apparatus of a seconduser, to the first display apparatus; and a transmission unit thattransmits data indicative of a direction corresponding to the seconddisplay apparatus, to the first display apparatus; wherein the firstdisplay apparatus includes an output unit that outputs the directionindicated by the data in a manner that allows the first user to discernthe direction, wherein the distribution unit distributes, in response toan instruction to change an orientation of the first display apparatusto the direction corresponding to the second display apparatus, thesecond partial image clipped in a range specified in association withthe second display apparatus, to the first display apparatus.
 2. Theinformation processing apparatus according to claim 1, furthercomprising: an accumulation unit that accumulates the acquired capturedimage including the second partial image clipped in the past accordingto the range specified in association with the second display apparatus;wherein the distribution unit distributes the second partial imageclipped in the past, from the accumulated image, to the first displayapparatus.
 3. The information processing apparatus according to claim 1,further comprising: a sound acquisition unit configured to acquire asound that reaches the second display apparatus by acquiring voice datatransmitted from the second display apparatus; and wherein thetransmission unit that transmits voice data that indicates a directioncorresponding to the second display apparatus, and the acquired sound,to the first display apparatus; and wherein the first display apparatusincludes a speaker that outputs configured to output the sound indicatedby the voice data transmitted by the transmission unit, as sound thatcomes from the direction indicated by the voice data; and upon aninstruction being made to change an orientation of the first displayapparatus to the direction indicated by the voice data transmitted tothe first display apparatus, the distribution unit distributes thesecond partial image clipped according to the range specified inassociation with the second display apparatus, to the first displayapparatus.
 4. The information processing apparatus according to claim 1,wherein: the transmission unit that transmits a direction instructingimage that indicates a direction corresponding to the second displayapparatus, to the first display apparatus; and when an instruction ismade to change an orientation of the first display apparatus to thedirection indicated by the direction instructing image thus transmitted,the distribution unit distributes the second partial image clippedaccording to the range specified in association with the second displayapparatus, to the first display apparatus.
 5. The information processingapparatus according to claim 1, further comprising: a positionacquisition unit that acquires position information indicating at leastone position of one or more display apparatuses; wherein thedistribution unit determines another display apparatus from among theone or more display apparatuses, from which the position informationindicating a position that is at a distance within a predetermined rangeto the first display apparatus has been acquired, as the second displayapparatus to distribute the second partial image.
 6. The informationprocessing apparatus according to claim 1, further comprising: alanguage acquisition unit that acquires language information indicatingat least one language that is used in one or more display apparatuses;wherein the distribution unit determines another display apparatus fromamong the one or more display apparatuses, from which the languageinformation indicating a language that is the same as a language used inthe first display apparatus has been acquired, as the second displayapparatus to distribute the second partial image.
 7. The informationprocessing apparatus according to claim 1, further comprising: a timezone acquisition unit that acquires time zone information indicating atleast one standard time zone used at a position of one or more displayapparatuses; wherein the distribution unit determines another displayapparatus from among the one or more display apparatuses, of which astandard time zone indicated by the time zone information acquiredtherefrom has a time difference smaller than a threshold value whencompared to another standard time zone indicated by the time zoneinformation acquired from the first display apparatus, as the seconddisplay apparatus to distribute the second partial image.
 8. Theinformation processing apparatus according to claim 1, furthercomprising: a plurality of capturing apparatuses that capture images;wherein the distribution unit determines another display apparatus thatis displaying an image captured by a capturing apparatus that is thesame as a capturing apparatus that has captured an image displayed bythe first display apparatus, as the second display apparatus todistribute the second partial image.
 9. The information processingapparatus according to claim 1, further comprising: a plurality ofcapturing apparatuses that capture images; and a capture positionacquisition unit that acquires capture position information indicatingpositions of the plurality of capturing apparatuses; wherein when theplurality of capturing apparatuses include another capturing apparatusfrom which the capture position information indicating a position at adistance within a predetermined range to a capturing apparatus that hascaptured image displayed by the first display apparatus has beenacquired, the distribution unit determines another display apparatusthat is displaying the image captured by the other capturing apparatusas the second display apparatus to distribute the second partial image.